Search CORE

174 research outputs found

Clustering, Hamming Embedding, Generalized LSH and the Max Norm

Author: A. Gionis
J.L. Krivine
L. Danzer
L.V. Buchok
N. Alon
N. Srebro
Publication venue
Publication date: 01/01/2014
Field of study

We study the convex relaxation of clustering and hamming embedding, focusing on the asymmetric case (co-clustering and asymmetric hamming embedding), understanding their relationship to LSH as studied by (Charikar 2002) and to the max-norm ball, and the differences between their symmetric and asymmetric versions.Comment: 17 page

arXiv.org e-Print Archive

CiteSeerX

Crossref

Factorizing LambdaMART for cold start recommendations

Author: Alexandros Kalousis
CJ Burges
CJ Burges
D Cai
J Fürnkranz
JH Friedman
Jun Wang
M Hilario
N Srebro
Phong Nguyen
Publication venue
Publication date: 04/11/2015
Field of study

Recommendation systems often rely on point-wise loss metrics such as the mean squared error. However, in real recommendation settings only few items are presented to a user. This observation has recently encouraged the use of rank-based metrics. LambdaMART is the state-of-the-art algorithm in learning to rank which relies on such a metric. Despite its success it does not have a principled regularization mechanism relying in empirical approaches to control model complexity leaving it thus prone to overfitting. Motivated by the fact that very often the users' and items' descriptions as well as the preference behavior can be well summarized by a small number of hidden factors, we propose a novel algorithm, LambdaMART Matrix Factorization (LambdaMART-MF), that learns a low rank latent representation of users and items using gradient boosted trees. The algorithm factorizes lambdaMART by defining relevance scores as the inner product of the learned representations of the users and items. The low rank is essentially a model complexity controller; on top of it we propose additional regularizers to constraint the learned latent representations that reflect the user and item manifolds as these are defined by their original feature based descriptors and the preference behavior. Finally we also propose to use a weighted variant of NDCG to reduce the penalty for similar items with large rating discrepancy. We experiment on two very different recommendation datasets, meta-mining and movies-users, and evaluate the performance of LambdaMART-MF, with and without regularization, in the cold start setting as well as in the simpler matrix completion setting. In both cases it outperforms in a significant manner current state of the art algorithms

arXiv.org e-Print Archive

Crossref

Hes-so: ArODES Open Archive (University of Applied Sciences and Arts Western Switzerland / Haute école spécialisée de Suisse occidentale / FH Westschweiz)

Archive ouverte UNIGE

Acid/base-triggered switching of circularly polarized luminescence and electronic circular dichroism in organic and organometallic helicenes.

Author: Autschbach J.
Crassous C.
Deol K. K.
Moore II B.
Muller G.
Roussel C.
Saleh S.
Srebro M.
Toupet L.
Vanthuyne N.
Williams J. A. G.
Publication venue: 'Wiley'
Publication date: 01/01/2015
Field of study

Electronic circular dichroism and circularly polarized luminescence acid/base switching activity has been demonstrated in helicene-bipyridine proligand 1 a and in its “rollover” cycloplatinated derivative 2 a. Whereas proligand 1 a displays a strong bathochromic shift (>160 nm) of the nonpolarized and circularly polarized luminescence upon protonation, complex 2 a displays slightly stronger emission. This strikingly different behavior between singlet emission in the organic helicene and triplet emission in the organometallic derivative has been rationalized by using quantum-chemical calculations. The very large bathochromic shift of the emission observed upon protonation of azahelicene-bipyridine 1 a has been attributed to the decrease in aromaticity (promoting a charge-transfer-type transition rather than a π–π* transition) as well as an increase in the HOMO–LUMO character of the transition and stabilization of the LUMO level upon protonation

Durham Research Online

HAL AMU

PubMed Central

HAL Descartes

Jagiellonian Univeristy Repository

Hal-Diderot

HAL-Rennes 1

Fast Differentially Private Matrix Factorization

Author: Ahn S.
Chen T.
Ding N.
Hartstein A.
Keshavan R.
Kyrola A.
Marsaglia G.
Meka R.
Mir D. J.
Neal R. M.
Niu F.
Sato I.
Srebro N.
Wang Y.-X.
Wang Y.-X.
Welling M.
Xin Y.
Zhao H.
Publication venue
Publication date: 07/05/2015
Field of study

Differentially private collaborative filtering is a challenging task, both in terms of accuracy and speed. We present a simple algorithm that is provably differentially private, while offering good performance, using a novel connection of differential privacy to Bayesian posterior sampling via Stochastic Gradient Langevin Dynamics. Due to its simplicity the algorithm lends itself to efficient implementation. By careful systems design and by exploiting the power law behavior of the data to maximize CPU cache bandwidth we are able to generate 1024 dimensional models at a rate of 8.5 million recommendations per second on a single PC

arXiv.org e-Print Archive

Crossref

Acid/base-triggered switching of circularly polarized luminescence and electronic circular dichroism in organic and organometallic helicenes

Author: Autschbach J.
Crassous C.
Deol K.K.
Moore B.
Muller G.
Roussel C.
Saleh S.
Srebro M.
Toupet L.
Vanthuyne N.
Williams J.A.G.
Publication venue: Wiley
Publication date: 21/11/2014
Field of study

Durham Research Online

Regularized fitted Q-iteration: application to planning

Author: A. Antos
B. Schölkopf
D. Ernst
D. Ormoneit
D.-X. Zhou
D.P. Bertsekas
F. Bunea
L. Györfi
N. Srebro
R. Munos
S. Mannor
X. Xu
Y. Engel
Y. Engel
Publication venue: 'Springer Fachmedien Wiesbaden GmbH'
Publication date: 01/01/2008
Field of study

We consider planning in a Markovian decision problem, i.e., the problem of finding a good policy given access to a generative model of the environment. We propose to use fitted Q-iteration with penalized (or regularized) least-squares regression as the regression subroutine to address the problem of controlling model-complexity. The algorithm is presented in detail for the case when the function space is a reproducing kernel Hilbert space underlying a user-chosen kernel function. We derive bounds on the quality of the solution and argue that data-dependent penalties can lead to almost optimal performance. A simple example is used to illustrate the benefits of using a penalized procedure

CiteSeerX

Crossref

SZTAKI Publication Repository

Socially Enabled Preference Learning from Implicit Feedback Data

Author: D. Agarwal
G. Takacs
H. Ma
H. Ma
J. Basilico
J. Noel
M. Deshpande
M. Jamali
M. Weimer
N. Srebro
N. Srebro
P. Cui
R. Salakhutdinov
S.-H. Yang
W.-J. Li
Y. Zheng
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2013
Field of study

Crossref

Joint covariate selection and joint subspace selection for multiple classification problems

Author: A. Maurer
A. Torralba
B. Efron
B. Wu
Ben Taskar
F. Bach
F. Bach
F. Chiaromonte
Guillaume Obozinski
J. Khan
K.C. Li
L. Meier
M. Breukelen van
M. Yuan
M.R. Osborne
Michael I. Jordan
N. Srebro
N. Srebro
N. Srebro
N.R. Draper
P. Zhao
R. Ando
R. Tibshirani
S. Ben-David
S. Boyd
S. Rosset
T. Hastie
T. Jebara
W. Fu
Y. Kim
Publication venue: 'Springer Science and Business Media LLC'
Publication date
Field of study

Crossref

Exploiting the bin-class histograms for feature selection on discrete data

Author: A Ferreira
B Franay
G Brown
G Strang
I Witten
L Kurgan
N Srebro
R Duda
S Garcia
T Cover
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2015
Field of study

In machine learning and pattern recognition tasks, the use of feature discretization techniques may have several advantages. The discretized features may hold enough information for the learning task at hand, while ignoring minor fluctuations that are irrelevant or harmful for that task. The discretized features have more compact representations that may yield both better accuracy and lower training time, as compared to the use of the original features. However, in many cases, mainly with medium and high-dimensional data, the large number of features usually implies that there is some redundancy among them. Thus, we may further apply feature selection (FS) techniques on the discrete data, keeping the most relevant features, while discarding the irrelevant and redundant ones. In this paper, we propose relevance and redundancy criteria for supervised feature selection techniques on discrete data. These criteria are applied to the bin-class histograms of the discrete features. The experimental results, on public benchmark data, show that the proposed criteria can achieve better accuracy than widely used relevance and redundancy criteria, such as mutual information and the Fisher ratio

Repositório Científico do Instituto Politécnico de Lisboa

Crossref

Learning multi-linear representations of distributions for efficient inference

Author: A. Darwiche
A. Darwiche
D. Burdick
D. Gunopulos
D. Heckerman
D. Roth
Dan Roth
E. Castillo
F. V. Jensen
J. Pearl
J. S. Yedidia
M. J. Wainwright
M. Jaeger
M. Meila
N. L. Zhang
N. Srebro
Rajhans Samdani
W. R. Gilks
Publication venue: 'Springer Science and Business Media LLC'
Publication date
Field of study

Crossref